Naı̈ve-Bayes vs. Rule-Learning in Classification of Email

نویسنده

  • Jefferson Provost
چکیده

Recent growth in the use of email for communication and the corresponding growth in the volume of email received has made automatic processing of email desirable. Two learning methods, naı̈ve bayesian learning with bag-valued features and the RIPPER rule-learning algorithm have shown promise in other text categorization tasks. I present three experiments in automatic mail foldering and spam filtering, showing that naı̈ve bayes outperforms RIPPER in classification accuracy.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integrating Naïve Bayes and FOIL

A novel relational learning approach that tightly integrates the naı̈ve Bayes learning scheme with the inductive logic programming rule-learner FOIL is presented. In contrast to previous combinations that have employed naı̈ve Bayes only for post-processing the rule sets, the presented approach employs the naı̈ve Bayes criterion to guide its search directly. The proposed technique is implemented in...

متن کامل

Integrating Naı̈ve Bayes and FOIL ∗

A novel relational learning approach that tightly integrates the naı̈ve Bayes learning scheme with the inductive logic programming rule-learner FOIL is presented. In contrast to previous combinations that have employed naı̈ve Bayes only for post-processing the rule sets, the presented approach employs the naı̈ve Bayes criterion to guide its search directly. The proposed technique is implemented in...

متن کامل

nFOIL: Integrating Naı̈ve Bayes and FOIL

We present the system nFOIL. It tightly integrates the naı̈ve Bayes learning scheme with the inductive logic programming rule-learner FOIL. In contrast to previous combinations, which have employed naı̈ve Bayes only for post-processing the rule sets, nFOIL employs the naı̈ve Bayes criterion to directly guide its search. Experimental evidence shows that nFOIL performs better than both its base line...

متن کامل

Not So Naı̈ve Online Bayesian Spam Filter

Spam filtering, as a key problem in electronic communication, has drawn significant attention due to increasingly huge amounts of junk email on the Internet. Content-based filtering is one reliable method in combating with spammers’ changing tactics. Naı̈ve Bayes (NB) is one of the earliest content-based machine learning methods both in theory and practice in combating with spammers, which is ea...

متن کامل

nFOIL: Integrating Naïve Bayes and FOIL

We present the system nFOIL. It tightly integrates the naı̈ve Bayes learning scheme with the inductive logic programming rule-learner FOIL. In contrast to previous combinations, which have employed naı̈ve Bayes only for post-processing the rule sets, nFOIL employs the naı̈ve Bayes criterion to directly guide its search. Experimental evidence shows that nFOIL performs better than both its base line...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999